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(57) Abstract 

Processes for conjupting proteins with polyctliylcne glycol are disclosed. The disclosed processes provide modified proteins having 
little or no decrease in tiieir activity and include the steps of deleting at least one amino acid residue on the protein, replacing the at least one 
amino acid rcssidue widi an amino acid residue that docs not react with polyethylene glycol, and contacting the protein with polyethylene 
glycol under conditions sufficient to conjugate the polyethylene glycol to the protein. This advantageous retention of a desired protein 
activity is attributed to the availability of one or more protein binding sites which is unaltered in the conjugation process and thus remains 
free to interact with a binding partner ligand or cognate subsequent to the conjugation process. 
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TITLE 

SITE SPECIFIC PROTEIN MODIFICATION BY MUTAGENESIS 

5 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to processes for modifying proteins. More 
10 . particularly, the present invention involves processes for linking polyethylene glycol to 
proteins in a manner which provides advantages associated with polyethylene glycol 
conjugated proteins while maintaining a desired protein bioactivity. 



Description of Related Art 

15 Processes and reagents for chemically modifying proteins have been used 

extensively for decades. Traditionally, protein chemical modifications were carried out in 
order to study their functional properties and structural characteristics. With the 
emergence of recombinant DNA techniques and interest in protein therapeutics, 
researchers have chemically modified proteins to improve their clinical performance. In 

20 particular, processes for conjugating proteins with polyethylene glycol have gained 
widespread use within the pharmaceutical and biochemical communities as a result of 
numerous improved pharmacological and biological properties associated with 
polyethylene glycol conjugated proteins. For example, polyethylene glycol conjugated 
proteins are known to have significantly enhanced plasma half life, and thus have 

25 substantially improved the cUnical usefulness. Additionally, polyethylene glycol 

conjugated proteins generally have reduced antigenicity and inununogenicity, thereby are 
less prone to causing life-threatening anaphylaxis. 

Another benefit associated with polyethylene glycol conjugated proteins is that of 
water solubility which is increased as a result of the high water solubility of polyethylene 

30 glycol. The increased water solubility can improve the protein's formulation 

characteristics at physiological pH's and can decrease complications associated with 
aggregation of low solubility proteins. 

Additionally, polyethylene glycol conjugated proteins have found use in 
bioindustrial applications such as enzyme based reactions in which the reaction 

35 environment is not optimal for the enzyme's activity. For example, some polyethylene 
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glycol conjugated enzymes demonstrate a wider optimum pH activity and reduced 
optimum activity temperature. Moreover, enzymes having reduced activity in many 
organic solvents have been successfully conjugated with polyethylene glycol to a degree 
that renders them useful for catalyzing reactions in organic solvents. For example, 
5 polyethylene glycol has been conjugated with horseradish peroxidase which then becomes 
soluble and active in chlorofonn and toluene (Urrotigoity et al., Biocatalysis, 2:145 - 149, 
1989). 

Polyethylene glycol conjugated proteins vary in the extent to which plasma 
circulation half life is increased, immunogenicity is reduced, water solubility is enhanced, 

10 and enzymatic activity is improved. Factors responsible for these variations are numerous 
and include the degree to which the protein is substituted with polyethylene glycol, the 
chemistries used to attach the polyethylene glycol to the protein, and the locations of the 
polyethylene glycol sites on the protein. 

The most common methods for attaching polyethylene glycol to proteins involve 

IS activating at least one of the hydroxyl groups on the polyethylene glycol with a 

functionality susceptible to nucleophilic attack by the nitrogen of amino groups on the 
protein. These methods generally result in loss of biological activity due to the 
nonspecific attachment of polyethylene glycol 

Alternative approaches to conjugating proteins with polyethylene glycol include 

20 controlling the conjugation reactants and conditions so that the conjugation site is 
confined to the N-terminus (Kinstier et al. Pharm. Res, 13:996, 1996); attaching 
polyethylene glycol to protein carbohydrate functionalities (Urrutigoity et al. Biocatalysis 
2:145, 1989); attaching polyethylene glycol at protein cysteine residues (Goodson et al. 
Biotechnology 5:343, 1990); attaching polyethylene glycol during solid phase and 

25 solution phase peptide synthesis (Felix, ACS Symposium Series 680 ch 16, 1997) and, 
selectively replacing protein arginine residues with lysine residues that provide an 
polyethylene glycol attachment site (Hershfield et al. Proc. Natl. Acad Sci, SS:7185, 
1991). While these offer some degree of control of the reaction site, there is a continuing 
need for improved methods for providing polyethylene glycol conjugated proteins. In 

30 particular, it would be desirable to provide methods for conjugating proteins with 

polyethylene glycol that result in modified proteins having enhanced bioactivity or littie 
loss in bioactivity while maintaining the benefits of polyethylene glycol conjugation, 
including substantially decreased immunogenicity, increased solubility, and prolonged 
circulation half lives characteristic of modified proteins. 
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SUMMARY OF THE INVENTION 
The present invention provides protein modification processes that result in 
modified proteins having little or no decrease in an activity associated with the protein. 
S More particularly, the invention described herein includes processes for modifying a 
protein by first deleting one or more amino acid sites on the protein that is suitable for 
polyethylene glycol conjugation and then contacting the protein with polyethylene glycol 
under conditions suitable for conjugating the polyethylene glycol to the protein. 
Preferably, the deleted amino acid residue is replaced with an amino acid residue that 

10 does not react with polyethylene glycol. The resulting polyethylene glycol conjugated 
protein has improved characteristics over proteins modified according to prior art 
procedures. An advantageous retention of activity is attributed to the availability of one 
or more protein binding sites which is unaltered in the protein modification process and 
thus remains free to interact with a binding partner subsequent to the modification 

15 process. Within the present invention are proteins useful for polyethylene glycol 

conjugation and polyethylene glycol conjugated proteins prepared by processes described 
herein. 



BRIEF DESCRIPTION OF THE DRAWINGS 

20 Fig. 1 illustrates lysine residues within p75 TNF receptor extracellular domain that are 
polyethylene glycol conjugation sites and lysine residues that make contact with TNFa. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides processes and reagents for conjugating proteins or 

25 polypeptides with polyethylene glycol in a manner that results in polyethylene glycol 
conjugated proteins having little or no reduction in a desired activity. More specifically, 
the present invention provides processes for conjugating polyethylene glycol with 
proteins under conditions which preclude polyethylene glycol conjugation at one or more 
selected sites on the protein. When the one or more selected sites is active in a protein 

30 binding domain, preventing polyethylene glycol conjugation at the site in accordance with 
the present invention contributes to maintaining a desired bioactivity while providing 
benefits associated with polyethylene glycol conjugation. 



3 
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The processes of the present invention are based upon the discovery that by 
deleting one or more selected amino acid residues that are capable of reacting with 
polyethylene glycol sites, and then conjugating the protein with polyethylene glycol, the 
resulting polyethylene glycol modified protein does not demonstrate a significant 
5 reduction in a desired activity. In one embodiment, the selected amino acid residue is a 
lysine residue that, if reacted with a polyethylene glycol, interferes with the ability of the 
resulting conjugated protein to bind with its binding partner, substrate, or receptor. It is 
believed that the selected amino acid residues are associated with binding sites, and, if 
modified, interfere with the conjugated protein's structural elements that determine 

10 protein conformation and function. By deleting the selected amino acid residue, 

polyethylene glycol does not modify the protein at the site of the selected amino acid 
residue during a subsequent polyethylene glycol modification reaction. Preferably, in 
order to preserve the number of amino acid residues and maintain the optimum protein 
conformation, the deleted amino acid residue is replaced with an amino acid residue that 

15 is not reactive with polyethylene glycol under the reaction conditions. For example, 
lysine can be deleted and replaced with an arginine residue. Arginine has the same 
structure as lysine, with the exception of the polyethylene glycol reactive e-NHj 
functionality on lysine which is absent in arginine. 

Any protein is suitable for polyethylene glycol modification in accordance with 

20 the present invention including but not limited to protein ligands, receptors, antigens, 

antibodies, enzymes, protein fragments, peptides, and polypeptides. Particularly desirable 
protein candidates for polyethylene glycol modification as described herein are those 
which, subsequent to their modification by prior art methods, demonstrate a reduction in a 
desired activity. Other proteins which are suitable for modification in accordance with 

25 the present invention are those having multiple binding sites. In this embodiment, a 
protein may be conjugated with polyethylene glycol so that an activity associated with 
one or more of the multiple binding sites can be reduced while maintaining an activity 
associated with one or more different binding sites. This is accomplished by deleting one 
or more selected amino acid residues that are associated with binding sites for which 

30 activity is to be maintained and which are capable of reacting with polyethylene glycol, 
and leaving amino acid residues associated with binding sites for which activity is to be 
reduced for subsequent polyethylene glycol conjugation. Preferably, the deleted amino 
acid residue or residues is replaced with an amino acid residue that is not reactive with 
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polyethylene glycol under the reaction conditions. Additionally, the deleted amino acid 
residue or residues is replaced with an amino acid residue that does not significantly 
diminish the activity of the native protein. The resulting polyethylene glycol conjugated 
protein will have an activity associated with selected binding sites and, depending upon 
5 the degree to which additional sites are involved in the conjugation process, will have a 
diminished, or no activity, associated with such additional sites. This approach is useful 
in cases in which cognate or substrate binding to one or more protein binding sites is 
desirably suppressed in certain clinical, diagnostic or industrial applications. 

Proteins that may be modified in accordance with the present invention include 

10 those having utility in clinical and diagnostics applications and those used in the 
biotechnology industry, such as enzymes in bioreactors. Receptors which may be 
modified as taught herein include cytokine receptors, for example, TNFR, IL-4R, IL-IR, 
IL-17R, IL-15R, p55 TNFRtFc and p75 TNFRrFc . Candidate antibodies for conjugation 
include but are not limited to OKT3 (anti-T-Cell), AVAKINE™ (anti-TNF) and anti 

15 Her2/Neu. Enzymes of interest for conjugation include CD39. tPA. and DNAse. Many 
proteins have multimeric binding sites and require more than one association for activity. 
Such proteins are particularly desirable for modification since loss of one binding site 
leaves the whole protein inactive. Members of the group of multimeric proteins include 
TNF, hGH, QD40L, and FasL. Other candidate protein ligands are known to bind 

20 multiple receptor subunits and include IL-2, E^-IS, GM-CSF, and G-CSF. 

In accordance with the present invention, a selected amino acid residue is one that 
is associated with a site on a protein which contributes to a specific function of that 
protein, and which is reactive with polyethylene glycol under the protein modification 
reaction conditions. The selected amino acid residue may be directly involved with a 

25 binding association with a protein binding partner. Alternatively, the amino acid may be 
sufficiently central to the spatial configuration of the protein that modifying the protein 
with polyetfiylene glycol results in a significant loss of desirable properties even though 
the amino acid residue is neither within a binding site nor directly or physical involved 
with the protein's interaction with a binding partner. Sites include, but are not limited to, 

30 cognate sites or substrate binding sites which arc associated with a protein activity. 

Amino acid residues that are reactive with polyethylene glycol under conditions 
known in the art include those having residues having nucleophilic moieties that are 
available for reaction with polyethylene glycol or an activated polyethylene glycol. For 
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example, lysine is reactive with polyethylene glycol through its e-NH,; aspartic acid and 
glutamic acid are reactive with polyethylene glycol through their COOH (carboxyl) 
functionalities; serine and threonine are potentially reactive through their OH (hydroxyl) 
sites; and, cysteine with available SH (sulfhydryl) groups may also react with 
5 polyethylene glycol. Conditions suitable for reactions between polyethylene glycol or 
activated polyethylene glycols and specific amino acid residues in proteins are known and 
those skilled in the art are charged with knowledge such reactions. It is known in the art 
that lysine residues react with activated polyethylene glycol under favorable reaction 
conditions and with minimum side reactions. Thus, in accordance with the present 

10 invention, lysine residues are typically the targeted residue and the reaction conditions are 
controlled to maximize the reaction between polyethylene glycol and lysine. 

Determining a suitable amino acid residue to select for deletion and, preferably, 
replacement, can be accomplished using a number of different techniques. In cases where 
the three dimensional structure and epitopes or structural elements that determine protein 

15 function are not known, one method involves using site directed mutagenesis techniques 
to empirically determine amino acid residues that are associated with a site on a protein 
which contribute to a specific function of that protein. More particularly, one or more 
predetermined polyethylene glycol reactive amino acid residues on the protein can be 
deleted and preferably replaced with non reactive amino acid residues using mutagenesis 

20 and recombinant DNA methodologies. Conjugating the thus modified protein with 

polyethylene glycol and then testing the resulting polyethylene glycol conjugated protein 
for activity and other relevant properties provides valuable information relating to the 
suitability of the predetermined amino acid residue or residues for deletion and 
replacement. Sequentially repeating the above described process for different 

25 polyethylene glycol reactive amino acid residues will provide more complete information 
relating to the role of the deleted amino acid residue in determining the function and 
activity of the protein. For example, if a protein hais 8 lysine residues, DNA encoding the 
protein can be mutated in a site directed manner to produce a number of different mutants 
with one or more of the codons coding for the lysine residues replaced with codons 

30 coding for an arginine residue. The specific lysine coding codons that are mutated can 
include one selected codon, all of the lysine coding codons, are any permutation of the 
lysine coding codons, including the simultaneous mutagenesis of DNA coding lysine 
residues that are adjacent to each other. 
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After expressing, collecting and purifying the engineered proteins encoded by the 
mutated DNA, the expressed proteins can be reacted with polyethylene glycol to form a 
conjugated protein. Then the conjugated protein can be tested for functional activity and 
other characteristics such as immunogenicity, physiological clearance, and solubihty. 
5 The polyethylene glycol conjugated proteins that have the desired activity and most 
favorable clearance, solubility and inmiunogenicity properties also contain the desired 
selected lysine residues i.e., the residues that had been deleted and replaced prior to 
reacting the protein with polyethylene glycol. 

For many proteins, the location of polyethylene glycol reactive amino acid 

10 residues and their conformational contribution to the structure and function of the protein 
are known. Among these proteins, are those for which the crystalline structure of the 
protein is known, and, in some cases, the crystalline structure of the protein-binding 
partner complex is known. For these proteins, determining a selected amino acid residue 
typically requires only identifying the residues that are within the protein's binding 

15 domain or in close spatial proximity to the protein's binding region and identifying those 
residues that are reactive with polyethylene glycol under the contemplated polyetiiylene 
glycol reaction conditions. 

In accordance with the present invention, deleting a selected amino acid residue on 
the protein can be accomplished with a variety of suitable procedures that provide 

20 modified proteins. In the context of the present invention, such procedures include, but 
are not limited to, site directed mutagenesis techniques and direct protein synthesis 
methods in which the protein lacking one or more selected amino acid residues is 
synthesized using standard protein synthesis procedures known in the art. As noted 
above, preferably the process of deleting a selected amino acid residue additionally 

25 involves replacing the selected amino acid residue with an amino acid residue that is not 
reactive with polyethylene glycol. 

Proteins may be prepared by any of a number of conventional techniques. A 
desired DNA sequence may be chemically synthesized using techniques known per se. 
DNA fragments also may be produced by restriction endonuclease digestion of a full 

30 length cloned DNA sequence, and isolated by electrophoresis on agarose gels. Linkers 
containing restriction endonuclease cleavage site(s) may be employed to insert the desired 
DNA fragment into an expression vector, or the fragment may be digested at cleavage 
sites naturally present therein. 
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Alterations of amino acid sequence, including deleting selected amino acid 
residues and replacing the deleted residues with a different residue, may be accomplished 
by any of a number of conventional methods. Mutations can be introduced at particular 
loci by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction 
5 sites enabling ligation to fragments of the native sequence. Following ligation, the 
resulting reconstructed sequence encodes an analog having the desired amino acid 
insertion, substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can 
be employed to provide an altered gene wherein predetermined codons can be altered by 

10 substitution, deletion or insertion. Exemplary methods of making the alterations set forth 
above are disclosed by Walder et al. (Gene 42:133, 1986); Bauer et al. (Gene 37:73, 
1985); Craik {BioTechniques, January 1985, 12-19); Smith et al. {Genetic Engineering: 
Principles and Methods^ Plenum Press, 1981); Kunkel {Proc. Natl Acad, Scl USA 
82:488, 1985); Kunkel et al. {Methods in EnzymoL 154:367, 1987); and U.S. Patent Nos. 

15 4,5 1 8,584 and 4,737,462 all of which are incorporated by reference. 

Similarly, the present invention provides methodologies for preventing multimeric 
association of proteins. For example, polyethylene glycol can be selectively conjugated 
onto sites in or around the multimeric association interface, while preserving the binding 
of the protein for its natural cognate through "site protected" polyethylene glycol 

20 conjugation as taught herein, thus preventing receptor multimerization. 

After preparing an altered protein having at least one selected amino acid residue 
that is deleted and preferably replaced with an amino acid residue that does not react with 
polyethylene glycol under the chosen reaction conditions, the altered protein is conjugated 
with polyethylene glycol. Reagents and procedures for forming polyethylene glycol- 

25 protein conjugates are known in the art per se and are generally applicable to the practice 
of the present invention. Typically, these procedures involve first providing an activated 
polyethylene glycol in which one or both hydroxyl groups on a polyethylene glycol are 
activated, and reacting the activated polyethylene glycol with active sites on a protein 
selected for polyethylene glycol conjugation. As mentioned above, the most widely 

30 utilized procedures for conjugating a protein with polyethylene glycol are based upon a 
nucleophilic reaction between protein amino sites (the e-amine nitrogen of lysine or the a- 
amino terminal amine) and an activated hydroxyl of polyethylene glycol. Since 
sulfhydryls are also nucleophiles, cysteine sulfhydryls that are not part of a disulfide 

8 
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bridge are also potential reaction sites on the protein. The general principles of 
polyethylene glycol conjugation with proteins, and common activating reagents are 
described by Delgado et al. in The Uses and Properties of PEG-Iinked Proteins, from 
Critical Reviews in Therapeutic Drug Carrier Systems, 9(3,4):249-304 (1992) and the 
5 ACS Symposium Series 680 ed. y Hanis et al., Poly(ethylene glycol) Chemistry and 
Biological Applications 1997, both of which are incorporated herein by reference. 

Activated forms of polyethylene glycol and monomethoxypolyethylene glycol are 
conmiercially available and may be used in processes of the present invention . Most 
notably, Shearwater Polymers, Inc of Huntsville, AL provides a number of polyethylene 

10 glycol polymers and polyethylene glycol derivatives. The Shearwater Polymers, Inc 
Catalog (Shearwater Polymers, Inc. Catalog Functionalized Biocompatible Polymers for 
Research, 1997-1998 incorporated herein by reference) describes and makes available a 
wide variety of activated polyethylene glycols suitable for coupling with proteins under a 
wide range of reaction conditions. This catalog additionally provides preferred reaction 

15 conditions for their derivatized polyethylene glycol reagents. Those skilled in the art 
having been made aware of the numerous reagents suitable for conjugating proteins with 
polyethylene glycol will appreciate the variety of reagent choices in view of the nature of 
the protein selected, the nature of the reactive amino groups or sulfhydryl groups on the 
protein and the end use of the conjugated protein. For example, to provide conjugated 

20 proteins having improved solubility, activity characteristics and delivery properties but 
not necessarily increased clinical clearance time, a succinimidyl succinate activated 
polyethylene glycol (SS-PEG) can be used in the conjugation reaction. The ester link to 
the protein is less stable and will hydrolyze in vivo , releasing the polyethylene glycol 
from the protein. Activated polyethylene glycols are available which will more 

25 preferentially react with amino groups as opposed to sulfhydryl groups and vice versa. 
Commonly selected activated polyethylene glycols include succinimidyl carbonate 
activated polyethylene glycols, succinimidyl succinate activated polyethylene glycol and 
succimidyl propionic acid polyethylene glycols. 

As an alternative to selecting commercially available activated polyethylene 

30 glycols, a polyethylene glycol of interest may be activated using reagents which react 
with hydroxyl functionalities to form a site reactive with a site on a protein of interest. 
Typically, the protein reactive site is an amino group but can be a sulfhydryl or hydroxyl 
and the activated polyethylene glycol typically is an active ester or imidizole (See pgs 274 
- 285 ibid.) Preferably, only one hydroxyl functionality of the polyethylene glycol is 
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activated which can be accomplished by utilizing a monomethoxypolyethylene glycol in 
an activating reaction. However, processes in which two hydroxyls are activated are 
within the scope of the present invention. Depending upon the nature of the activating 
group and the nucleophilic attack, the activating moiety may or may not become 
5 incoiporated into the protein following the nucleophilic reaction. 

The polyethylene glycol may be of any molecular weight but is preferably in the 
range of about 500 to about 100,000 and more preferably in the range of 2,000 to 20,000. 
The criteria for selecting a specific polyethylene glycol molecular weight include, but are 
not limited to, the molecular weight of the protein selected for modification, the charge on 

10 the protein, type of protein and the number and location of potential sites for conjugation. 
Immunological and plasma half-life characteristics of proteins conjugated with different 
molecular polyethylene glycols molecular weight are discussed in Delgado et al, Critical 
Reviews in Therapeutic Drug Carrier Systems, 9 :249, 1992 and the ACS Symposium 
Series 680, Harris et al. Poly(ethylene glycol) Chemistry and Biological Applications, 

15 1997. As known in the art, in general, the greater the amount of polyethylene glycol 
conjugated to the protein, the longer the plasma half-life and the greater the protein 
solubility. Since the molecular weight cut-off for glomerular filtration is roughly 70kDa, 
proteins having molecular weights less than about 70kDa will experience lengthened 
plasma half-life. For proteins larger than 70kDa, the effects of the polyethylene glycol 

20 and its molecular weight will vary with its clearance mechanism. 

In general, using a polyethylene glycol having a high molecular weight in the 
processes of the present invention results in conjugated proteins having more 
polyethylene glycol per molecule of protein than using polyethylene glycol having a 
lower molecular weight. Thus, when a high amount of polyethylene glycol per protein 

25 molecule is desirable, the molecular weight of the polyethylene glycol is preferably up to 
20,000. However, smaller molecular weight polyethylene glycols, because of their greater 
solution mobility, may conjugate to more sites on the protein than a higher molecular 
protein. Thus, when a protein has a number of desired conjugation sites it may be 
preferable to use a polyethylene glycol having a lower molecular weight to assure that an 

30 optimum number of sites is conjugated. This may be a particulariy desirable approach 
when the potential conjugation sites or reaction site on the protein are in close proximity 
to each other. Another consideration used in selecting a polyethylene glycol molecular 
weight is that even though proteins treated in accordance with the present invention have 
protected sites, larger molecular weight polyethylene glycols may be so large that, once 

10 
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conjugated, their molecular size causes them to extend their spacial or steric influence so 
that binding or receptor sites have reduced accessibility. It is within the knowledge of 
those skilled in the art to determine an optimum polyethylene glycol molecular weight for 
any selected protein and benefits desired from the polyethylene glycol conjugation. 
5 While the above described polyethylene glycol conjugation procedures are those 

in which the result is polyethylene glycol conjugated to protein via a covalent bond, it is 
within the scope of the present invention to include procedures in which the conjugation 
is via a different association. In the context of the present invention, proteins may be 
modified by conjugating them to polyethylene glycol using a variety of different linking 

10 or conjugating mechanisms. For example, a protein selected for conjugation can be 
derivatized at an amino group or other suitably reactive functionality with a polyA 
oligonucleotide and then conjugated with a polyeUiylene glycol derivatized with a polyT 
oligonucleotide. Another approach involves derivatizing the protein with a functionality 
having a known specific binding partner and then conjugating the protein with 

15 polyethylene glycol which has been derivatized with die binding partner for the 

functionality. For example, a protein can be derivatized with biotin and the polyethylene 
glycol derivatized with streptavidin or avidin (or vice versa). This results in Uie specific 
binding of polyethylene glycol to those protein sites having the biotin. A number of 
reagents for modifying proteins for the purpose of introducing certain functionalities are 

20 commercially available. For example, the Pierce ImmunoTechnology catalogue identifies 
and provides access to a variety of reagents associated with protein modification. Among 
these are Traut's Reagents and SATA (Pierce ImmunoTechnology Catalogue, Vol I, pg 
E-14) which can introduce active groups at N-terminal amines and lysine amino 
functionalities. These active groups provide sites for further introducing functionalities 

25 for reacting more specifically with polyethylene glycol. Those skilled in the art will also 
recognize that ionic interactions between polyethylene glycol and a protein of interest are 
also possible. For example, an association between an ionic moiety on the protein and its 
counter ion on polyethylene glycol can be utilized if the association is sufficientiy strong 
to remain associated under physiological conditions. 

30 Further embodiments of the present invention which may utilize prior modified 

proteins include those processes in which the protein selected for conjugation has too few 
potential polyethylene glycol conjugation sites or no potential polyethylene glycol 
conjugation sites outside the protected amino acid region. By modifying the selected 
protein to introduce amino and sulfhydryl sites on the protein sufficient polyethylene 
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glycol may be conjugated to the selected protein to provide the desired benefits. 
Modifying the selected protein can be achieved using genetic engineering methodologies 
or chemical modification. As mentioned above, processes and reagents for modifying 
proteins to achieve a large variety of desired results are well known in the art. In 
5 particular, in Wong, Chemistry of Protein Conjugation and Cross-linking y CRC Press, 
1993, incorporated herein by reference, provides information relating to conjugation 
reagents and process conditions. 

While polyethylene glycol is a preferred protein conjugating reactant, a variety of 
additional polymer modifiers have been used to modify proteins. These include modified 

10 polyethylene glycols, branched polyethylene glycols, crosslinked polyethylene glycols, 
dextrans. polyvinylpyrrolidone, polyvinylalcohol, polyamino acids, albumin and gelatins. 
Those skilled in the art will appreciate, once having an understanding of the present 
invention, that the principles and methods described herein can be applied to processes for 
modifying proteins with any of these additional reagents. 

IS Proteins modified according to the procedures described herein have benefits 

associated with polyethylene glycol conjugation without the expected significant loss in 
activity. By merely applying known testing procedures to establish post conjugation 
activity, the benefits to proteins conjugated in accordance with the present invention can 
be demonstrated. Activity tests are specific for the protein and should be selected 

20 according to the protein of interest. Many proteins have more than one site associated 
with one or more activities The choice of activity for measurement for such proteins 
depends upon the activity of interest and the site which is specifically selected for the 
amino acid residue deletion and subsequent conjugation reaction. In addition to 
evaluating polyethylene glycol conjugated proteins for their activity, they can be analyzed 

25 for the degree of polyethylene glycol substitution, molecular weight, and sites of 

conjugation. Techniques for performing these analytical procedures are well known and 
some are described with respect to polyethylene glycol conjugated proteins in Critical 
. Reviews in Therapeutic Drug Carrier Systems, 9(3:4):2S5 - 291. 1992. Example 4-6 
describe exemplary methods for characterizing polyethylene glycol conjugated proteins. 

30 In addition to providing compounds having improved bioactivity characteristics, 

the processes of the present invention provide polyethylene glycol conjugated molecule 
product that is more homogeneous and in higher yields. Because conjugation will not 
take place at amino acid residues that are critical to the molecule^s bioactivity, the 
reaction product need not be purified by cutting out numerous unwanted product 
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fractions. Because the polyethylene glycol reaction can be taken to completion and all the 
available polyethylene glycol sites can be fully reacted, the final product is more 
homogeneous than prior art products which are prepared under conditions that favor 
reaction at specific sites. 

The following examples are presented in order to provide a more detailed 
description of specific embodiments of the present invention and are not to be construed 
as limiting the scope of the invention. 

EXAMPLE 1 

Selecting a Protein Modification Site 

The following describes a procedure for identifying amino acid residues of p75 

TNF receptor for deletion and substitution in accordance with the present invention. 
Because the expected polyethylene glycol modification reaction conditions were to be 
those that favor modification of the e-amino group of lysine residues and the N-terminal 
amine, the amino acids identified were lysine residues that make contact between the TNF 
receptor and the ligand in the TNF receptor-ligand complex. 

The p75 TNF receptor is from a family of structurally homologous receptors 
which includes the p55 TNF receptor. TNFa and TNFP (TNF ligands) compete for 
binding to the p55 and p75 TNF receptors. The x-ray crystal structure of the complex 
formed by the extracellular domain of the human p55 TNF receptor and TNFP has been 
determined (Banner et al. Cell 75:431, 1993, incorporated herein by reference). This 
crystallography work confirmed that the complex of p55 TNF receptor and TNFp has 
three p55 TNF receptor molecules bound symmetrically to one TNFP trimer. The studies 
further demonstrated that the receptor binds in a groove between two adjacent TNFP 
subunits. Advantageously, the crystal structure of the complex provides a model for TNF 
receptor structure and activation and can be used to identify amino acid domains within 
the ligand and in the receptor that make contact to for the complex. 

A sequence alignment of the p55 TNF receptor amino acid sequence and the p75 
TNF receptor amino acid sequence reveals that p75 TNF receptor residues K34. K42, 
K47. K108. K120. and K140 are closely aligned with p55 TNF receptor residues K32. 
Y40, G45, S108. L119 and T138. (See Banner et al. Cell 75:431. 1993). Based upon this 
alignment information and molecular modeling that illustrates the spatial positions of 
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lysine residues on p75 TNF receptor, it can be seen that two lysine residues on the p75 
receptor make contact between the p75 receptor and ligand. These lysine residues are 
K108 and K120 (the lysine at position 108 and the lysine at position 120). Fig. 1 
provides an amino acid sequence of the extracellular domain of p75 TNF receptor 
5 (without the signal sequence) and illustrates lysine residues that are polyethylene glycol 
conjugation sites and lysine residues that make contact with TNFa. Thus, the lysine 
residues at positions 108 and 120 were selected for deletion and substitution in 
accordance with this invention. 

EXAMPLE 2 

10 Preparing Wildtype p7STNF Receptor and Mutant p7S TNF Receptor 

The following describes processes for preparing a wildtype soluble p75 TNF 
Receptor molecule (extracellular domain of p75 TNF receptor) and three mutant soluble 
TNF receptor molecules. The wildtype soluble p75 TNF Receptor has the nucleotide and 
amino acid sequences described in SEQ ED N0:7 and SEQ ID NO:8. The wildtype and 

15 mutant TNF receptor molecules utilized in the following experiments were the 
extracellular domains without the signal peptide. 

The soluble p75 TNF receptor in the form of a covalently dimerized fusion 
construct of two extracellular, ligand binding portions of the human p75 TNF receptor 
fused together by an IgGlFc moiety (TNFRiFc) (Mohler et al. J. Immunol. 757:1548 - 

20 1561, 1993) was prepared by expressing the protein in CHO cells using the dihydrofolate 
reductase selectable amplifiable marker. Suspension cells were centrifuged and 
resiispended into serum-free medium in a controlled bioreactor. The product was 
collected after 7 days and the TNFR.Fc molecule was purified using protein A affinity 
chromatography followed by an ion-exchange chromatography step. 

25 For each of the three mutant soluble TNF receptor molecules a specific lysine, K, 

was deleted and an arginine, R, was engineered in the same position. More specifically, 
the lysine at position 108 and/or the lysine at position 120 were mutated individually so 
that two single mutants (K108R or K120R) and one double mutant (K108R, K120R) were 
prepared in which the K at position 108 and/or position 120 was replaced by an R at the 

30 same position. SEQ ID N0:1 provides the nucleotide sequence for the K108R mutant 
and SEQ ID N0:2 describes the amino acid sequence encoded by SEQ ID NO: 1 . SEQ ID 
N0:3 provides the nucleotide sequence for the K120R mutant and SEQ ID N0:4 
describes the amino acid sequences encoded by SEQ ID N0:3. SEQ ID NO:5 provides 
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the nucleotide sequence for the K108R, K120R mutant and SEQ ID NO:6 describes the 
amino acid sequences encoded by SEQ ID N0:5. 

Briefly, the mutants were prepared using site directed mutagenesis of K108 and/or 
K120 in the human p75 TNF receptor using PGR mutagenesis of the Sfrl-Notl fragment 
5 of hTNF receptor and Fc fusion protein (hTNFR:Fc). The mutant TNF receptor 

fragments were ligated in frame with a human Fc fragment in the mammalian expression 
vector sf Haveo409. Several of the prepared clones were sequence to confirm that the 
desired nucleic acid changes were incorporated into the mutein nucleotide sequences. 

More particularly, PGR mutagenesis was used to generate mutated 430 base pair 
10 Sal/Sfrl fragments. The PGR mutagenesis procedures utilized wild type TNFR cDNA 
(SEQ ID N0:7) used as the template for the PGR reactions. The oligonucleotide 
sequences used in the PGR reactions to generate the 3 mutant Sall-Srfl DNA fr^agments 
were as follows: 

For the TNF receptor (K108R) mutant the 3' oligonucleotide contained an A to G 

15 substitution at position 389 and a Srfl site at the 3'end. For the TNF receptor (K120R) 
mutant the 3' oligonucleotide contained an A to G substitution at position 425 and a Srfl 
site at the 3'end. For the TNF receptor (K108R,K120R) mutant the oligonucleotide 
contained an A to G substitution at position 389 and 425 and a Srfl site at the 3' end. The 
5' oligonucleotide used to generate the mutant PGR DNA fragments had no nucleotide 

20 changes in the TNFR coding nucleotides and contained the 5' Sail site. 

For the PGR Reactions the Boehringer Mannheim Expand High Fidelity PGR kit 
and reagents were used according to manufacturer's directions. The PGR cycling 
protocol involved the following conditions: 94°G for 2 minutes;94X for 30 seconds; 
SO^'G for 15 seconds, 72°G for 1 minute. 25 cycle reaction. 

25 The DNA fragments generated in the PGR reactions were separated on a 1% 

agarose gel and the 430 base pair TNFR fragments were isolated using GeneGlean reagent 
from BIOIOI. The isolated fragments were restriction digested with Sail and Srfl from 
NEB in Universal Restriction Buffer from Stratagene. The DNA was then repurified 
using the GeneGlean reagents from BIOIOI. 

30 Each of the mutant Sal 1/Srf 1 DNA 430 fragments generated above (and 

corresponding to the 5'end of the TNF receptor) was individually ligated with the 1065 
basepair Srf 1/Notl DNA fragment corresponding to the 3' TNF receptor and human Fc 
cDNA and the 7730 basepair Sall/Notl pDG409 expression. 20ng of tiie pDG409 vector 
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was used for each ligated reaction and the TNF receptor fragments were present at a 3- 
fold higher molar concentration. The ligation reaction was done in Boehringer Mannheim 
ligation mix with 500 units of ligase enzyme at room temperature for 3 hours. 

The ligation reaction mixtures were dialyzed and 1/10 of the reaction mixture was 
5 electroporated into E. coli DHIOB cells. 10 colonies finom each construction were grown 
in liquid culture and the expression vector constructs was confirmed using restriction 
enzyme analysis. Hie TNF receptor cDNA insert in one construct of each of the 3 mutants 
was analyzed by nucleotide sequencing to confirm the desired nucleotide mutations. 

The three mutant fusion cDNA constructs were transfected into CVl/EBNA cells. 

10 The transfected cells were cultured at 37°C for 7 days and then conditioned media firom 
these cells was harvested and monitored for TNFRrFc expression using an Fc EUSA 
assay. The conditioned media was also monitored for TNF receptor bioactivity using an 
A375 cell growth bibassay that is based upon measuring inhibition of TNF activity. The 
three TNFR:Fc mutants and the TNFRrFc wildtype construct demonstrated similar 

1 5 receptor molecule expression levels. 

In order to collect and purify the mutant TNF receptor proteins, supematants from 
the transfected CVl/EBNA cells were collected 7 days post transfection and clarified by 
centrifugation and filtration through a 0.45 |im filter. Purification of the collected and 
filtered wild type protein and the mutant proteins was carried out using protein A affinity 

20 chromatography. A protein A sepharose column was used to capture the Fc portion of the 
fusion proteins. Once bound, the protein was washed with 3 column volumes of 25 mM 
TRlS/140 nvM NaCl at pH7.4 and elated with 3 columns volumes of 50 mM sodium 
acetate/100 mM NaCl at pH 4.0. Each eluted fusion protein was dialyzed against 20 mM 
Na^HPO^ at pH 7.4 and diluted to approximately 1 mg/mL. The final collected products 

25 were purified soluble p75 TI^RrFc mutants as described above. SEQ ID N0:1 provides 
the nucleotide sequence for the K108R mutant and SEQ ID N0:2 describes the amino 
acid sequence encoded by SEQ ID NO:L SEQ ID NO: 3 provides the nucleotide 
sequence for the K120R mutant and SEQ ID N0:4 describes the amino acid sequences 
encoded by SEQ ID N0:3. SEQ ID NO:S provides the nucleotide sequence for the 

30 K108R, K120R mutant and SEQ ID N0:6 describes the amino acid sequences encoded by 
SEQIDN0:5, 
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EXAMPLES 

Conjugating Wildtype and Mutant p75 TNF:Fc Receptors with Polyethylene Glycol 

The following describes a process for preparing polyethylene glycol conjugated 
5 wildtype TNFR:Fc molecules and polyethylene glycol conjugated mutant TNFRiFc 

molecules. For each polyethylene glycol conjugation reaction, a one hundred micrograms 
(lOOjig) portion of wildtype TNFR:Fc, or mutant TNFR:Fc, prepared in Example 2 was 
dissolved in 400 jiL of 50 mM NaJIPO, at pH 8.5 and allowed to react with SPA-PEG 
5000 at different molar ratios of polyethylene glycol to protein (calculated as number of 
10 lysine residues in TNFRiFc) overnight at The molar ratios of protein to lysine 
residues 1 : 1 and 10: 1. SPA-PEG is a 5,000 MW succinimidyl carbonate activated 
monomethoxypolyethylene glycol purchased from Shearwater Polymers, Birmingham, 
AL. The protein and polyethylene glycol solutions were allowed to react overnight at 2- 

15 Each of the polyethylene glycol conjugated TNFRiFc molecules was purified by 

ion exchange chromatography using SP Sepharose Fast Flow resin (Pharmacia) 
equilibrated with 20 mM sodium phosphate. pH 7.4. Polyethylene glycol conjugated 
TNFR:Fc bound to the resin under these conditions. Unreactcd polyethylene glycol and 
reaction byproducts were rinsed from the column with 5 column volumes of the 

20 equilibration buffer. The polyethylene glycol conjugated TNFR:Fc was eluted from the 
column with five column volumes of 20 mM sodium phosphate, 200 mM NaCl, pH 7.4. 
The eluted fractions were pooled and concentrated to approximately 1-5 mg/mL. 

The following indicates the designation given each of the TNFR:Fc molecules 
conjugated with polyethylene glycol (PEG) by the above described procedure: 

25 1. PEG-TNFR:Fc(K108R, K120R); 

2. PEG-TNFR:Fc(K108R); 

3. PEG-TNFR:Fc(K120R); 

4. PEG-TNFR:Fc. 

30 EXAMPLE 4 

Characterization of Coiyugated TNFR:Fc 

The following describes the characterization of polyethylene glycol conjugated 
wildtype polyethylene glycol conjugated mutant TNFRiFc molecules prepared in 
35 Example 3 and a control characterization of unconjugated wildtype and mutant TNFRiFc 



17 



wo 99/67291 



PCT/US99/13953 



molecules prepared in Example 2. The characterization analyses included SDS- 
polyacrylamide gel electrophoresis, size exclusion chromatography, ELISA and in vitro 
bioassay testing. 

SDS-PAGE gradient gels of 4-20% acrylamide (Novex, San Diego) were run with 
5 1 [tg of each polyethylene glycol conjugated mutant TNFRiFc molecule and polyethylene 
glycol conjugated wildtype TNFR:Fc, The gels were stained with Novex fast stain 
according to manufacturer's instructions. The gradient gels showed that the degree of 
polyethylene glycol conjugation was similar for each of the polyethylene glycol 
conjugated mutant TNFR:Fc molecules and the polyethylene glycol conjugated wildtype 

10 TNFRrFc molecule. 

Size exclusion chromatography was performed on each of the molecules 
conjugated with polyethylene glycol as described in Example 3. The size exclusion 
characterization was performed using a Waters HPLC system from Millipore Corp. 
Milford, MA that was equipped with a 300 x 8 mm SEC-400 Biosil column from BioRad. 

IS Sample injection sizes were 50-100 fig and the mobile phase was phosphate ;buffered 
saline at 1 mL/min. The results confirmed that the polyethylene glycol conjugated 
mutants and the polyethylene glycol conjugated wildtype TNFRrFc had substantial 
increases in overall size. More particularly, depending upon the ratio of polyethylene 
glycol to lysine used in the conjugation reaction, the polyethylene glycol conjugated 

20 molecules were 2-3 times larger than the unconjugated molecules. 

The polyethylene glycol conjugated mutant TNFR:Fc molecules, the polyethylene 
glycol conjugated wildtype TNFR:Fc molecule and unconjugated forms of TNFR:Fc 
were subjected to ELISA testing that involved coating 96 well microtiter plates with anti- 
IgGl-Fc monoclonal antibodies, applying the polyethylene glycol modified molecules to 

25 the microtiter plates and allowing them to bind with the anti-IgGl-Fc antibodies. A 

secondary polyclonal anti-TNFR antibody was used to detect the quantity of polyethylene 
glycol conjugated molecules and the quantity of unconjugated TNFRiFc bound to the 
plate. The results of these studies demonstrated that the polyethylene glycol conjugated 
mutant TNFR:Fc and polyethylene conjugated wildtype TNFRiFc reduced or eliminated 

30 binding with anti-IgGl-Fc and /or anti-TNFR antibodies. The results suggest that 
polyethylene glycol conjugation shields epitopes that are active in smtibody binding. 
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Example 5 

Pharmacokinetics of Wildtype and Mutant TNFR:Fc Molecules 
The following describes experiments designed to compare the phamiacokinelics 
of wildtype TNFR:Fc with the polyethylene glycol conjugated TNFRiFc mutant molecule 
5 K108R JC120R (the lysine at 108 and 120 substituted with arginine). The mutant 
molecule had been conjugated with a polyethylene glycol:lysine ration of 10:1. 

Groups of 2 10-12 week old female BALB/c mice were injected intravenously 
with 10 iig of wildtype TNFR:Fc or conjugated mutant TNFR:Fc in a total volume of 
lOOfiL. Following the injection, mice were sacriflced and blood samples were collected 
10 at S minutes, 1 hour, 8 hours, 24 hours, 48 hours and 72 hours via cardiac puncture. 

Plasma samples were analyzed by A37S bioassay. The elimination half lives, tV^, of the 
polyethylene conjugated mutant and the wildtype TNFR:Fc were determined. The half- 
life values are presented as tV4 +/- S.E. were S.R indicates the standard error in fitting the 
log linear line to the data points. The tVi of wildtype TNFRiFc was determined to be 16,5 
15 +/- 1.0 hours and that of the polyethylene glycol mutant was determined to be 36.5 +/-8.5 
hours. 

The results of the above experiments demonstrate that polyethylene glycol 
conjugated TNF receptor prepared in accordance with the present invention has a 
significantly enhanced circulation half life compared to a TNF receptor that is not 
20 polyethylene glycol conjugated. 

Example 6 

Bioactivity of Polyethylene Conjugated Wild type TNFR:Fc and Polyethylene 
Conjugated Mutant TNFR:Fc 

25 

The bioactivities of the polyethylene glycol conjugated TNFR:Fc molecules 
prepared in Example 3 were measured by in vitro A375 bioassays. This assay is generally 
described in Onozaki et al. J, Immunology 135:3962 (1985) and Nakai et al. Biochem, 
Biophys. Res, Comm, 754:1189 (1988). The bioassay is based upon the inhibitory 
30 response of the A375 human malignant melanoma adherent cell line to TNFo. Soluble 
TNFR:Fc can specifically neutralize the inhibitory activity of TNFa in a dose dependent 
manner. To perform the bioassay, 375 cell line (ATCC CRL 1872) was harvested using a 
trypsin-EDTA solution to remove the cell monolayer from flasks. The harvested cells 
were washed with an assay medium of Dulbeccos' Modified Eagles Medium with added 

19 
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fetal bovine senim, non-essential amino acids, and sodium pyruvate (all purchased from 
GIBCO). 

Ninety-six well plates were prepared with serial dilutions of working solutions of 
the polyethylene glycol conjugated mutant TNFRiFc described in Example 3. Then, 
5 equal amounts of TNFa (R&D Systems. Cat. No. #210-CA TF) in the assay medium 
described above were added to wells in 96 well plates followed by adding an equal 
volume of about 4 X lO' harvested cell suspension to each well. 

The plates were placed in a humidity chamber at 3T^C and 10% CO^ and the cells 
were allowed to incubate for 72 hours. Then the plates were removed from the chamber 

10 and the cells were washed with PBS solution, blotted, and fixed with ethyl alcohol. 

Viable cells were made visible by staining the fixed cells with 0.1% aqueous crystal violet 
solution. After washing the plates with water and blotting die cells, 2% sodium 
deoxycholate solution was added to each well and the wells of each plate were read for 
optical density at S70 nm on a plate reader using Delta Soft microplate analysis software. 

IS Standard bioactivity units were assigned for each sample and adjusted to take into account 
the concentration of TNFR:Fc in the wells. Wells containing blanks were assigned a 
bioactivity of zero. 

The results of the A375 bioassays demonstrated the following order of activity for 
the polyethylene glycol conjugated molecules: 

20 

PEG-TNFR:Fc(K108R.K120R,) > PEG-TNFR:Fc(K108R) » PEG-TNFR:Fc(K120R) = 
PEG-TNFR:Fc (PEG =>poIyethylene glycol conjugated) 

The results indicate that the polyethylene glycol conjugated TNFRrFc molecules retain 
23 significant biological activity as determined by in vitro methods. Because the TNFR:Fc 
mutein PEG-TNFR:Fc(108R), in which the lysine at position 108 was mutated to 
arginine, retains much greater activity than the mutein in which the lysine at 120 is 
mutated to arginine, it is suggested the polyethylene glycol conjugated to K108 interferes 
with TNF binding. When this residue is mutated to R108, polyethylene conjugation at the 
30 108 position is prevented and does not significantiy reduce TNF binding activity. 
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What is claimed is: 

.1. A process for conjugating a protein with polyethylene glycol, said process 
5 comprising the steps of 

deleting at least one amino acid residue on the protein, and 
contacting the protein with polyethylene glycol under conditions sufficient 
to conjugate the polyethylene glycol to the protein. 

10 2. The process of claim 1 further including the step of replacing the at least one 
amino acid residue with an amino acid residue that does not react with polyethylene 
glycol. 

3. The process of claim 2 wherein the one or more amino acid residues is associated 
15 with protein binding or protein association. 

4. The process of claim 2 wherein the at least one amino acid residue is lysine. 

5. The process of claim 2 wherein the amino acid residue that does not react with 
20 polyethylene glycol is arginine. 

6. The process of claim 1 wherein the step of contacting the protected protein with 
polyethylene glycol comprises causing an activated polyethylene glycol to react with 
nucleophiles on the protein. 

25 

7. A polyethylene glycol conjugated protein prepared according to the process of 
claim 1. 

8. A process for modifying a protein, said process comprising the steps of 
30 deleting at least one amino acid residue on the protein; 

replacing the at least one amino acid residue with an amino acid residue 
that does not react with polyethylene glycol; and 

contacting the protein with polyethylene glycol under conditions sufficient 
to conjugate the polyethylene glycol to the protein. 

21 
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9. The process of claim 8 wherein the at least one amino acid residue is lysine and 
the amino acid residue that does not react with polyethylene glycol is arginine. 

10. A polyethylene glycol conjugated protein prepared according to the process of 
5 claim 8. 

1 L A process for modifying a protein, said process comprising the steps of 

deleting at least one lysine residue on the protein, the lysine residue being 

associated with binding sites of the protein; 
10 replacing the at least one lysine residue with an arginine; and 

contacting the protein with polyethylene glycol under conditions sufficient 

to conjugate the polyethylene glycol to the protein by reacting the polyethylene glycol 

with lysine residues. 

IS 12. A polyethylene glycol conjugated protein prepared according to the process of 
claim 11. 

13. The process of claim 1 1 wherein the protein is a TNF receptor. 

20 14. A process for modifying a TNFR, said process comprising the steps of 

deleting at least one lysine residue on the TNFR the lysine residue being 
associated with TNF ligand binding sites; 

replacing the at least one lysine residue with an arginine; and 
contacting the TNFR having replaced lysine residues with polyethylene 
25 glycol under conditions sufficient to conjugate the polyethylene glycol to the TNFR by 
reacting the polyethylene glycol with lysine residues. 

15. A polyethylene glycol conjugated soluble TNFR prepared according to the process 
of claim 14. 

30 

16. A soluble TNFR mutant polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID N0:4, SEQ ID N0:6 and SEQ ID N0:8. 
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17. A DNA encoding mutant soluble TNFR polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID N0:2, SEQ ID NO:4 and SEQ ID 
NO:6. 

5 18. A mutant polypeptide of claim 16 wherein the polypeptide is polyethylene glycol 
conjugated. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



TTG 
Leu 
1 


CCC 
Pro 


GCC 
Ala 


CAG 
Gin 


GTG 
Val 
5 


GCA 
Ala 


TTT 
Phe 


ACA CCC TAC 
Thr Pro Tyr 
10 


GCC 
Ala 


CCG 
Pro 


GAG 
Glu 


CCC 
Pro 


GGG 
Gly 
15 


AGC 
Ser 


48 


ACA 
Thr 


TGC 
Cys 


CGG 
Arg 


CTC 
Leu 
20 


AGA 
Arg 


GAA 
Glu 


TAC 
Tyr 


TAT GAC CAG 
Tyr Asp Gin 
25 


ACA 
Thr 


GCT 
Ala 


CAG 
Gin 


ATG 
Met 
30 


TGC 
Cys 


TGC 
Cys 


96 


AGC 
Ser 


AAA 
Lys 


TGC 
Cys 
35 


TCG 
Ser 


CCG 
Pro 


GGC 
Gly 


CAA 
Gin 


CAT GCA AAA 
His Ala Lys 
40 


GTC 
Val 


TTC 
Phe 


TGT 
Cys 
45 


ACC 
Thr 


AAG 
Lys 


ACC 
Thr 


144 


TCG 
Ser 


GAC 
Asp 
50 


ACC 
Thr 


GTG 
val 


TGT 
Cys 


GAC 
Asp 


TCC 
Ser 
55 


TGT GAG GAC 
Cys Glu Asp 


AGC 
Ser 


ACA 
Thr 
60 


TAC 
Tyr 


ACC 
Thr 


CAG 
Gin 


CTC 
Leu 


192 


TOG 
Trp 
65 


AAC 
Asn 


TGG 
Trp 


GTT 
Val 


CCC 
Pro 


GAG 
Glu 
70 


TGC 
Cys 


TTG AGC TGT 
Leu Ser Cys 


GGC 
Gly 
75 


TCC 
Ser 


CGC 
Arg 


TGT 
Cys 


AGC 
Ser 


TCT 
Ser 
80 


240 


GAC 
Asp 


CAG 
Gin 


GTG 
Val 


GAA 
Glu 


ACT 
Thr 
85 


CAA 
Gin 


GCC 
Ala 


TGC ACT CGG 
Cys Thr Arg 
90 


GAA 
Glu 


CAG 
Gin 


AAC 
Asn 


CGC 
Arg 


ATC 
He 
95 


TGC 
Cys 


288 


ACC 
Thr 


TGC 
Cys 


AGG 
Arg 


CCC 
Pro 
100 


GGC 
Gly 


TGG 
Trp 


TAC 
Tyr 


TGC GCG CTG 
Cys Ala Leu 
105 


AGC 
Ser 


AGG 
Arg 


CAG 
Gin 


GAG 
Glu 
110 


GGG 
Gly 


TGC 
Cys 


336 


CGG 
Arg 


CTG 
Leu 


TGC 
Cys 
115 


GCG 
Ala 


CCG 
Pro 


CTG 
Leu 


CGC 
Arg 


AAG TGC CGC 
Lys Cys Arg 
120 


CCG 
Pro 


GGC 
Gly 


TTC 
Phe 
125 


GGC 
Gly 


GTG 
Val 


GCC 
Ala 


384 


AGA 
Arg 


CCA 
Pro 
130 


GGA 
Gly 


ACT 
Thr 


GAA 
Glu 


ACA 
Thr 


TCA 
Ser 
135 


GAC GTG GTG 
Asp Val Val 


TGC 
Cys 


AAG 
Lys 
140 


CCC 
Pro 


TGT 
Cys 


GCC 
Ala 


CCG 
Pro 


432 


GGG 
Gly 
145 


ACG 
Thr 


TTC 
Phe 


TCC 
Ser 


AAC 
Asn 


ACG 
Thr 
150 


ACT 
Thr 


TCA TCC ACG 
Ser Ser Thr 


GAT 
Asp 
155 


ATT 
He 


TGC 
Cys 


AGG 
Arg 


CCC 
Pro 


CAC 
His 
160 


480 


CAG 
Gin 


ATC 
He 


TGT 
Cys 


AAC 
Asn 


GTG 
Val 
165 


GTG 
Val 


GCC 
Ala 


ATC CCT GGG 
He Pro Gly 
170 


AAT 
Asn 


GCA 
Ala 


AGC 
Ser 


ATG 
Met 


GAT 
Asp 
175 


GCA 
Ala 


528 


GTC 
Val 


TGC 
Cys 


ACG 
Thr 


TCC 
Ser 
180 


ACG 
Thr 


TCC 
Ser 


CCC 
Pro 


ACC CGG AGT 
Thr Arg Ser 
185 


ATG 
Met 


GCC 
Ala 


CCA 
Pro 


GGG 
Gly 
190 


GCA 
Ala 


GTA 
Val 


576 


CAC 
His 


TTA 
Leu 


CCC 
Pro 
195 


CAG 
Gin 


CCA 
Pro 


GTG 
Val 


TCC 
Ser 


ACA CGA TCC 
Thr Arg Ser 
200 


CAA 
Gin 


CAC 
His 


ACG 
Thr 
205 


CAG 
Gin 


CCA 
Pro 


ACT 
Thr 


624 


CCA 
Pro 


GAA 
Glu 
210 


CCC 
Pro 


AGC 
Ser 


ACT 
Thr 


GCT 
Ala 


CCA 
Pro 
215 


AGC ACC TCC 
Ser Thr Ser 


TTC 
Phe 


CTG 
Leu 
220 


CTC 
Leu 


CCA 
Pro 


ATG 
Met 


GGC 
Gly 


672 


CCC 
Pro 
225 


AGC 
Ser 


CCC 
Pro 


CCA 
Pro 


GCT 
Ala 


GAA 
Glu 
230 


GGG 
Gly 


AGC ACT GGC 
Ser Thr Gly 


GAC 
Asp 
235 












705 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
15 10 15 

Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 
20 25 30 

Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 

Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50- 55 60 

Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys 
85 90 95 

Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Arg Gin Glu Gly Cys 
100 105 110 

Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 

Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
130 135 140 

Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His 
145 150 155 160 

Gin lie Cys Asn Val Val Ala He Pro Gly Asn Ala Ser Met Asp Ala 
165 170 175 

Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 
180 185 190 

His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 

Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
210 215 220 

Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N- terminal 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .705 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTG CCC GCC CAG GTG GCA TTT AC A CCC TAC GCC CCG GAG CCC GGG AGC 48 
Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
1 5 10 15 

ACA TGC CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT CAG ATG TGC TGC 96 
Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 
20 25 30 

AGC AAA TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC 144 
Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 

TCG GAC ACC GTG TGT GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC 192 
Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50 55 60 

TGG AAC TGG GTT CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT 240 
Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

GAC CAG GTG GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC 288 
Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg He Cys 
85 90 95 

ACC TGC AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC 336 
Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys 
100 105 110 

CGG CTG TGC GCG CCG CTG CGC AGG TGC CGC CCG GGC TTC GGC GTG GCC 384 
Arg Leu Cys Ala Pro Leu Arg Arg Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 

AGA CCA GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC CCG 432 
Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
130 135 140 

GGG ACG TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG CCC CAC 480 
Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp He Cys Arg Pro His 
145 150 155 160 

CAG ATC TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC ATG GAT GCA 528 
Gin He Cys Asn Val Val Ala He Pro Gly Asn Ala Ser Met Asp Ala 
165 170 175 

GTC TGC ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC CCA GGG GCA GTA 576 
Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 
180 185 190 

CAC TTA CCC CAG CCA GTG TCC ACA CGA TCC CAA CAC ACG CAG CCA ACT 624 
His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 
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CCA GAA CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC 672 
Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
210 215 220 

CCC AGC CCC CCA GCT GAA GGG AGC ACT GGC GAC 705 
Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
15 10 15 

Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 
20 25 30 

Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 

Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50 55 60 

Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys 
85 90 95 

Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys 
100 105 110 

Arg Leu Cys Ala Pro Leu Arg Arg Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 

Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
130 135 140 

Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His 
145 150 155 160 

Gin He Cys Asn Val Val Ala He Pro Gly Asn Ala Ser Met top Ala 
165 170 175 

Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 
180 185 190 

His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 

Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
210 215 220 

Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 



5 



wo 99/67291 



PCT/US99/13953 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..705 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TTG CCC GCC CAG GTG GCA TTT ACA CCC TAC GCC CCG GAG CCC GGG AGC 48 
Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
1 5 10 15 

ACA TGC CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT CAG ATG TGC TGC 96 
Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 
20 25 30 

AGC AAA TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC 144 
Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 

TCG GAC ACC GTG TGT GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC 192 
Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50 55 60 

TGG AAC TGG GTT CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT 240 
Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

GAC CAG GTG GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC 288 
Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg He Cys 
85 90 95 

ACC TGC AGG CCC GGC TGG TAC TGC GCG CTG AGC AGG CAG GAG GGG TGC 336 
Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Arg Gin Glu Gly Cys 
100 105 110 

CGG CTG TGC GCG CCG CTG CGC AGG TGC CGC CCG GGC TTC GGC GTG GCC 384 
Arg Leu Cys Ala Pro Leu Arg Arg Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 

AGA CCA GGA ACT. GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC CCG 432 
Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
130 135 140 

GGG ACG TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG CCC CAC 480 
Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp He Cys Arg Pro His 
145 150 155 160 

CAG ATC TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC ATG GAT GCA 528 
Gin He Cys Asn Val Val Ala He Pro Gly Asn Ala Ser Met Asp Ala 
165 170 175 
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GTC TGC ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC CCA GGG GCA GTA 576 
Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 
180 185 190 . 

CAC TTA CCC CAG CCA GTG TCC ACA CGA TCC CAA CAC ACG CAG CCA ACT 624 
His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 

CCA GAA CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC 672 
Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
210 215 220 

CCC AGC CCC CCA GCT GAA GGG AGC ACT GGC GAC 705 
Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
15 10 15 

Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 
20 25 30 

Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 

Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 

50 55 60 

Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys 
85 90 95 

Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Arg Gin Glu Gly Cys 
100 105 110 

Arg Leu Cys Ala Pro Leu Arg Arg Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 

Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 

130 135 140 

Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His 
145 150 155 160 

Gin He Cys Asn Val Val Ala He Pro Gly Asn Ala Ser Met Asp Ala 
165 170 175 

Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 
180 185 190 

His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 
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Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
210 215 220 

Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: N- terminal 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..705 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TTG CCC GCC CAG GTG GCA TTT ACA CCC TAC GCC CCG GAG CCC GGG AGC 48- 
Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
1 5 ' 10 15 

ACA TGC CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT CAG ATG TGC TGC 96 
Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 
20 25 30 

AGC AAA TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC 144 
Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 

TCG GAC ACC GTG TGT GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC 192 
Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50 55 60 

TGG AAC TGG GTT CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT 240 
Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

GAC CAG GTG GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC 288 
Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys 
85 90 95 

ACC TGC AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC 336 
Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys 

100 105 110 

CGG CTG TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC 384 
Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 

AGA CCA GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC CCG 432 
Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
130 135 140 
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GGG ACG TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG CCC CAC 480 
Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His 
145 150 155 160 

CAG ATC TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC ATG GAT GCA 528 
Gin He Cys Asn Val Val Ala He Pro Gly Asn Ala Ser Met Asp Ala 
165 170 175 

GTC TGC ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC CCA GGG GCA GTA 576 
Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 
180 185 190 

CAC TTA CCC CAG CCA GTG TCC ACA CGA TCC CAA CAC ACG CAG CCA ACT 624 
His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 

CCA GAA CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC 672 
Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
210 215 220 

CCC AGC CCC CCA GCT GAA GGG AGC ACT GGC GAC 705 
Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 235 ainino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
15 10 15 

Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 

20 25 30 

Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 

Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50 55 60 

Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg He Cys 
85 90 95 

Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys 
100 105 110 

Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 

Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
130 135 140 

Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp He Cys Arg Pro His 
145 150 155 160 
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Gin lie Cys Asn Val Val Ala He Pro Gly Asn Ala Ser Met Asp Ala 
165 170 175 

Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 
180 185 190 

His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 

Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
210 215 220 

Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 
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